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Phase transition in the Countdown problem 
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Here we present a combinatorial decision problem, inspired by the celebrated quiz show called 
the countdown, that involves the computation of a given target number T from a set of k randomly 
chosen integers along with a set of arithmetic operations. We find that the probability of winning the 
game evidences a threshold phenomenon that can be understood in the terms of an algorithmic phase 
transition as a function of the set size k. Numerical simulations show that such probability sharply 
transitions from zero to one at some critical value of the control parameter, hence separating the 
algorithm's parameter space in different phases. We also find that the system is maximally efficient 
close to the critical point. We then derive analytical expressions that match the numerical results 
for finite size and permit us to extrapolate the behavior in the thermodynamic limit. 
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In combinatorial optimization problems, a large 
amount of literature points out to the occurrence of a 
so called threshold phenomenon in the performance of 
search algorithms [1-4]: there exists a phase in param- 
eter space where the search algorithm can easily find a 
solution to the aforementioned combinatorial problem 
(as the number of available solutions is exponentially 
large with the system size), and a phase where such 
solution typically (i.e. almost surely) does not exist. 
The transition between both phases is sharp in some 
situations, mimicking in several aspects the phenomenon 
of a phase transition in statistical physics problems. 
Some classical problems evidencing such phenomenon 
include combinatorial problems in random graphs or 
the satisfaction of (random) boolean clauses, generically 
gathered under the umbrella of random constraint 
satisfaction problems (rCSP) [3, H[. Many of these 
concrete problems can indeed by interpreted under a 
statistical physics formalism @, [tJ, the general idea 
being the following: in a combinatorial optimization 
problem, in some cases one can formalize a cost function 
to be minimized. In satisfaction problems, this is for 
instance the number of violated constraints. Within 
statistical physics of disorders systems, such as in spin 
glasses, one indeed proceeds in the same manner if the 
system is studied in the limit of low temperature (in that 
situation, the system tries to adopt the ground state 
or minimal energy configuration). Thereby, the cost 
function of a combinatorial optimization problem can be 
related to the Hamiltonian of a disordered system at zero 
temperature (for instance, finding a minimum partition 
within the so-called partitioning problem is equivalent 
to finding the ground state of an infinite range Ising spin 
glass with Mattis-like, antiferromagnetic couplings [7[). 
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In this work we present a random decision problem, 
called the countdown problem, which is inspired in 
a celebrated british TV quiz show called Countdown 
(based itself on the French game show Des chiffres et 
des lettres, one of the longest-running game shows in the 
world, and receiving other names in several countries 
Q). This show is separated in several games, one of 
which incorporates the combinatorial problem of arith- 
metically combining some numbers to produce another 
one. Concretely, the contestants must use arithmetics 
to reach a given target number from six other numbers 
used each of them at much once. Here we formalize a 
random version of this decision problem and explore its 
solvability as a function of the parameter space. We 
will provide numerical evidence according to which the 
solvability of the decision problem shows the presence 
of an algorithmic phase transition, and will introduce 
an approximate theory, which we show to be on good 
agreement with the numerics for finite size systems and 
permits us to theoretically find the phase transition in 
the thermodynamic limit. 



Let us begin by defining the pool of size M as the 
integer interval [1,M]. Suppose that we randomly ex- 
tract with reposition from this pool a set of k integers 
X = {xi,X2, Xk} and another integer T called the tar- 
get. Stated as a decision problem, the target game raises 
the following question: for a given duple (k, M), which is 
the probability P(k, M) of reaching T by combining the 
elements of X (where each element Xi can be used or not, 
but each of them will be used at much once) through the 
set of arithmetic operations A = {+, — , x, -=-}? Once M 
is fixed, one can assert that for rather small values of k, as 
k = 2 for example, the amount of possible combinations 
is rather limited. For large values of k, the situation is the 
opposite: on average there will exist many possible ways 
of combining the elements in X to reach T. Whereas sev- 
eral parallels with random k-satisfiability can be outlined, 
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it can be shown that the problem at hands is not based on 
finding the correct variables assignment X, but the cor- 
rect Hamiltonian assignment H amongst a Hamiltonian 
ensemble . While this dual representation disables the 
possibility a standard statistical mechanics approach, we 
will take advantage of the number theoretical nature of 
the problem to propose a probabilistic theoretical treat- 
ment. We start be relaxing the problem statement by 
assuming that the set of available arithmetic operations 
is restricted to A ={+,—}• As the problem is computa- 
tionally nontrivial 10], in order to explore the problem 
numerically, we have implemented a brute force recursive 
routine that explores the search space in an exhaustive 
way (additional details in For a given pool size M, 
we fix k, make Monte Carlo simulations and perform en- 
semble averages over different realizations of T and X. 
The winning probability P(k, M) is defined as the prob- 
ability of reaching T by arithmetically combining at much 
the k numbers (where each of the k elements can be used 
or not, but each of them will be used at much once). In 
figured] we plot the results of P(k,M) vs k for different 
pool sizes M, averaged in each case over 10 4 realizations. 
Note that the transition between P ~ (loosing almost 
surely) and P ~ 1 (winning almost surely) is sharp and 
occurs at a certain k c (M), estimated as the linear in- 
terpolation of k for which P(k c , M) — 0.5, as usual in 
percolation theory. This crossover value depends on the 
pool size, since the control parameter k is not intensive. 
In the inset panel of figure [T] we plot, in linear-log scales, 
its dependence with system's size, finding a logarithmic 
scaling of the form k c (M) = alog(M) + b, with a = 0.98 
and b = 0.31. 

Our analytical treatment deals with the estimation of 
P(k,M), when k << M, and for that task we begin by 
considering a single operation, the sum. If we choose at 
random n numbers from [1, M], the largest possible value 
of its sum is nM. In a first approximation, we will sup- 
pose that the sum of n numbers is uniformly distributed 
in [1, nM]. The result of this sum will fall in [1, M] with 
probability 1/n. The amount of results in [1,M] accessi- 
ble from k numbers chosen at random from that interval 
is therefore T*. i( fc ). 

Let us proceed forward and introduce a second operation 
in our system: substraction. Incorporating this operation 
is approximately analogous to handle the former system, 
where now each number a from the n numbers chosen at 
random, represents both a and —a. Now, the result of 
summing up the 2n numbers will belong to [— nM, nM], 
such that, assuming uniformity once again, it will fall 
in [1,M] with probability l/2n. Proceeding as before, 
the amount of results in [1,M] accessible from k num- 
bers chosen at random from that interval, N{k), can be 
written now as 

N (k)=f: -if 2 *^ 2 "-*- 2 . a) 

v ' ^ In V n I 2k + 1 w 

n— 1 v / 

The third approximation consists in assuming that those 
N byproducts are indeed random independent trials of 
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FIG. 1: Winning probability P(k, M) curve as a function of k, 
for increasing pool sizes M, in the system with A = {+, — }. 
Solid curves are the analytical predictions (equations I2I4[) . 
(Inset panel) Linear- log plot of the crossover value scaling 
k c {M), for different values of the pool size M. In each case the 
crossover value is estimated as the linear interpolation of k for 
which P(k c ,M) = 0.5, as in percolation theory. The straight 
line is a fit to a logarithmic function fc c (M) = alog(M) + b 
with a = 0.98 and b = 0.31. 

finding the target in [1,M]. In that situation, the win- 
ning probability P(k,M) reads 

/ 1 \N(k) N{k) 

P(k,M) = l-(l-_J «l-e-— (2) 

for M » 1. This result works qualitatively, however it 
can be improved, taking equation Q] up to leading order 
and introducing dependency on M through a modulat- 
ing factor r(M) that quantifies the correlations amongst 
numbers, such that 

e r(M)k 

N ( k)M ) = — j —. (3) 

In order to estimate r(M), note that k c (M) is defined 
such that P(k c , M) = 1/2. From equation [5] we find 

r(M) = l0g ^ l0g2 >. (4) 

In figure [T] we plot, in solid lines, the theoretical values of 
P(k, M) as a result of equations [2]|4l with the appropri- 
ate scaling k c (M) previously reported, which show good 
agreement with the numerics. In figure [5] we extend the 
problem to the more general case where all elementary 
arithmetic operations are allowed, A = {+, — ,X,H-}. 
The numerical results are analogous to the simpler case, 
where now the finite size scaling of the critical point ful- 
fills k c (M) = a log M + b with a = 0.84 and b = 0.39. 
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FIG. 2: Similar numerical results as figure \T\ in the general 
case of A — {+, — , x, -7-}. The scaling of the critical point is 
again logarithmic k c (M) — alog(Af) + b, with a = 0.84 and 
b = 0.39, what shifts the critical point towards smaller values 
of k. Solid lines are the results of the theory (equations OH) 
with the latter scaling relation. 



Solid lines are the predictions of the theory with the lat- 
ter scaling, showing again good agreement with the nu- 
merical simulations. 

Introducing the intensive control parameter a — k/k c , 
in figure [3] we plot the numerical curves P(a,M) result- 
ing from the simulations performed in the general case. 
As the system size increases, the probability of satisfying 
the game gets sharper around the now size independent 
crossover value. The behavior in the thermodynamic 
limit (M — > oo, a finite) can be derived from equation 
[2HH finding a Heaviside step function 



Poo(&) = lim 1 — e 

M->oo 



if a < 1 

1 if a > 1 



(5) 



i.e. the onset of a true phase transition. 
Finally, let us define the function Q(k, M) that measures 
the system's efficiency as the average amount of potential 
targets in [1,M] that can be reached per unit number k. 
This function can be written as 



Q(k,M) 



P{k,M)M 



(6) 



whose behavior is shown in figure SJ This measure 
reaches a maximum in a neighborhood of k c (M), 
such that in the thermodynamic limit it diverges for 
a — > 1, deepening on the conjecture that states that the 
complexity of multicomponent systems is maximized 
close to their critical points [IJ [HI- This behavior 
is also related to the so called easy-hard-easy pattern 
that takes place in algorithmic phase transitions, and 
suggests that close to the critical point (hard phase) 



FIG. 3: Winning probability P(a, M) curve as a function 
of the intensive control parameter a — k/k c (M), where 
k c (M) — alog(M) + b is a scaling function of the crossover 
value with size (fitting values are a = 0.84, b — 0.39) accord- 
ing to figure [5] for increasing system sizes. The curves get 
sharper as M increases, pointing out to a phase transition in 
the thermodynamic limit: the onset of the so called thresh- 
old phenomenon. The Heaviside function, only reached in the 
thermodynamic limit, is a result of the theory (see the text). 



the computation time that the algorithm needs to come 
along with a solution is maximized: for a < I, the 
algorithm easily finds that the problem is unsatisfiable, 
whereas for a > 1 the algorithm easily finds one of 
many solutions, when a ~ 1 the number of solutions 
per unit number is optimal, and the algorithm needs 
to perform an exhaustive search of the whole space to 
find it. Note that within the quiz show Countdown, the 
pool is bounded to M = 1000. Interestingly enough, the 
contestants are allowed to make use of k = 6 numbers, 
what corresponds, according to our previous analysis, 
to the threshold between almost surely unsolvable to 
almost surely solvable instances. Driving the system 
towards the critical point assures that the game is hard 
but typically solvable, that is, interesting. 

To summarize, in this work we have presented a ran- 
dom combinatorial decision problem that evidences an 
algorithmic phase transition separating the parameter 
space where the problem is either almost surely satisfi- 
able or unsatisfiable. This work deepens on the relations 
between number theory and theoretical physics, whose 
interface has shown to be a potential breeding ground 
for new approaches in both areas. Some final re- 
marks can be outlined: first, note that while in physi- 
cal phase transitions the finite size scaling is usually in 
the form of a power law shape k c (M) ~ M a with some 
finite size exponent a, in this problem we find logarith- 
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FIG. 4: Efficiency measure Q(k, M) curve as a function of 
k, for increasing pool sizes M, in the system with A = 
{+, — , X,-:-}. Solid curves are the analytical predictions 
(equation [6]). This measure is maximized near the transition 
point. 



der to recover the usual power law scaling with system 
size k c (M) ~ M a , and make this parameter intensive (a 
temperature) through a — k/M a . On the other hand, we 
note that logarithmic scalings have been found previously 
in other number-theoretic systems evidencing collective 
phenomena [l4T - [l6l |. 

Finally, while the transition between unsatisfiable and 
satisfiable phases (loosing/winning) occurs at lower val- 
ues in the general case A = {+,— , x,-^} than in the 
simple one A = {+,—}, in both situations the sys- 
tem evidences the threshold phenomenon. Is this phe- 
nomenon only restricted to elementary arithmetic sys- 
tems or, much on the contrary, is this a fundamental 
behavior in abstract algebraic structures defined as a set 
of elements with some binary operations? On this re- 
spect, notice that the logarithmic scaling k c (M) can be 
interpreted as the growth rate of the minimal amount of 
elements needed to cover a growing system through bi- 
nary operations, since the amount of possible outcomes 
of combined binary operations grows exponentially fast. 
This is a challenging open question for future research, 
which could be addressed within combinatorial group 
theory fll\ . 
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